Computer data system data source refreshing using an update propagation graph having a merged join listener

ABSTRACT

Described are methods, systems and computer readable media for data source refreshing using an update propagation graph having a merged join listener.

This application claims the benefit of U.S. Provisional Application No. 62/549,908, entitled “COMPUTER DATA SYSTEM” and filed on Aug. 24, 2017, which is incorporated herein by reference in its entirety.

Embodiments relate generally to computer data systems, and more particularly, to methods, systems and computer readable media for data source join refreshing.

Data sources or objects within a computer data system may include static sources and dynamic sources. Some data sources or objects (e.g., tables) may depend on other data sources. As new data is received or obtained for dynamic data sources, those dynamic data sources may be refreshed (or updated). Data sources or objects that are dependent on one or more dynamic sources that have been refreshed may also need to be refreshed. The refreshing of data sources may need to be performed in an order based on dependencies to update join operations in a consistent and/or efficient manner.

Embodiments were conceived in light of the above mentioned needs, problems and/or limitations, among other things.

Some implementations (first implementations) include a system for updating a data object using update propagation graphs and merged join listeners to determine consistent join update processing. The system can include one or more hardware processors coupled to a nontransitory computer readable medium having stored thereon software instructions that, when executed by the one or more processors, cause the one or more processors to perform operations. The operations can include receiving a request to perform a join operation on a plurality of objects. The operations can also include creating an update propagation graph (UPG) for propagating updates to a result of the join operation. The operations can further include adding a merged join listener to the UPG as a child of the plurality of objects. The operations can also include adding a join result structure to the UPG as a child of the merged join listener. The operations can further include propagating updates through the UPG, including the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result structure's node of the UPG during that clock cycle to provide a consistent representation of updates to all sides of the join operation to the join result structure node. The merged join listener combining can include, for each current notification received at the merged join listener during the given update clock cycle, determining whether a priority queue includes an existing notification from the merged notification listener for the given clock cycle. The combining can also include, when the priority queue includes an existing notification from the merged notification listener for the given clock cycle, updating, responsive to the determining, the existing notification to include an additional notification based on the current notification. The combining can further include, when the priority queue does not include an existing notification from the merged notification listener for the given clock cycle, adding a new notification to the priority queue based on the current notification. The operations can also include receiving the merged notification at the join result structure node and applying changes to update the result of the join operation in a consistent manner without having to re-execute the full join operation.

In some first implementations, the plurality of objects are tables. In some first implementations, the join result structure is a table. In some first implementations, the UPG is directed acyclic graph (DAG). In some first implementations, the plurality of objects consists of three or more objects.

In some first implementations, the operations also include adding a second join result structure to the UPG as a second child of the merged join listener. In some first implementations, the operations further include receiving a second merged notification at the second join result structure's node of the UPG. In some first implementations, the join result structure comprises a plurality of matrices. In some first implementations, the join operation is an outer join, an inner join, and/or a cross join.

Some implementations (second implementations) can include a method that includes receiving, at a query processor, a request to perform a join operation on a plurality of objects. The method can also include creating, at the query processor, an update propagation graph (UPG) for propagating updates to a result of the join operation. The method can further include adding a merged join listener to the UPG as a child of the plurality of objects. The method can also include adding a join result to the UPG as a child of the merged join listener. The method can further include propagating updates through the UPG, including the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result's node of the UPG during that clock cycle to provide a consistent representation of updates to all sides of the join operation to the join result node. The method can also include receiving the merged notification at the join result node and applying changes to update the result of the join operation in a consistent manner without having to re-execute the full join operation.

In some second implementations, the plurality of objects consists of a plurality of tables. In some second implementations, the UPG is a directed acyclic graph (DAG).

In some second implementations, the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification includes receiving a notification during the given update clock cycle, determining that a priority queue includes an existing notification from the merged notification listener for the given clock cycle, and updating, responsive to the determining, the existing notification to include an additional notification based on the received notification.

In some second implementations, the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification includes receiving a notification during the given update clock cycle, determining that the priority queue does not include an existing notification from the merged notification listener for the given clock cycle, and adding, responsive to the determining, a new notification from the merged notification listener to the priority queue based on the received notification.

In some second implementations, the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification further includes receiving a second notification during the given update clock cycle, determining that the priority queue includes an existing notification from the merged notification listener for the given clock cycle, the existing notification being the new notification, and updating, responsive to the determining, the existing notification to include an additional notification based on the second notification.

In some second implementations, the method also includes adding a second join result structure to the UPG as a second child of the merged join listener. In some second implementations, the method further includes receiving a second merged notification at the second join result structure's node of the UPG. In some second implementations, the second merged notification is a copy of the merged notification. In some second implementations, the join result structure comprises a plurality of matrices. In some second implementations, the join operation can be an outer join, an inner join, and/or a cross join.

Some implementations (third implementations) include a nontransitory computer readable medium having stored thereon software instructions that, when executed by one or more processors, cause the one or more processors to perform operations. The operations can include receiving a request to perform a join operation on a plurality of objects. The operation can also include creating an update propagation graph (UPG) for propagating updates to a result of the join operation. The operations can further include adding a merged join listener to the UPG as a child of the plurality of objects. The operations can also include adding a join result structure to the UPG as a child of the merged join listener. The operations can further include propagating updates through the UPG, including the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result structure's node of the UPG during that clock cycle to provide a consistent representation of updates to all sides of the join operation to the join result structure node. The operations can also include receiving the merged notification at the join result structure node and applying changes to update the result of the join operation in a consistent manner without having to re-execute the full join operation.

In some third implementations, the plurality of objects are tables and the join result structure is a table. In some third implementations, the UPG is a directed acyclic graph (DAG). In some third implementations, the plurality of objects consists of three or more objects. In some third implementations, the operations also include adding a second join result structure to the UPG as a second child of the merged join listener.

In some third implementations, the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification includes, for each current notification received at the merged join listener during the given update clock cycle: determining whether a priority queue includes an existing notification from the merged notification listener for the given clock cycle. In such third implementations, the combining also includes, when the priority queue includes an existing notification from the merged notification listener for the given clock cycle, updating, responsive to the determining, the existing notification to include an additional notification based on the current notification. In such third implementations, the combining further includes, when the priority queue does not include an existing notification from the merged notification listener for the given clock cycle, adding a new notification to the priority queue based on the current notification.

In some third implementations, the join result structure comprises a plurality of matrices. In some third implementations, the join operation can be an outer join, an inner join, and/or a cross join.

Some implementations (fourth implementations) include a system that includes one or more hardware processors coupled to a nontransitory computer readable medium having stored thereon software instructions that, when executed by the one or more processors, cause the one or more processors to perform operations. The operations can include receiving a request to perform a join operation on a plurality of data sources. The operations can also include creating an update propagation graph (UPG) for propagating updates to a result of the join operation. The operations can further include adding a merged join listener to the UPG as a child of the plurality of objects. The operations can also include adding a join result structure to the UPG as a child of the merged join listener. The operations can further include propagating updates through the UPG, including the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result structure's node of the UPG during that clock cycle to provide a consistent representation of updates to all sides of the join operation to the join result structure node. The operations can also include receiving the merged notification at the join result structure node and applying changes to update the result of the join operation in a consistent manner without having to re-execute the full join operation.

In some fourth implementations, the join result structure is a table. In some fourth implementations, the UPG is a directed acyclic graph (DAG). In some fourth implementations, the plurality of data sources consists of three or more data sources.

In some fourth implementations, the operations also include adding a second join result structure to the UPG as a second child of the merged join listener. In some fourth implementations, the operations further include receiving a second merged notification at the second join result structure's node of the UPG. In some fourth implementations, the join result structure comprises a plurality of matrices. In some fourth implementations, the join operation can be outer join, an inner join, and/or a cross join.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram of an example computer data system showing an example data distribution configuration in accordance with some implementations.

FIG. 2 is a diagram of an example computer data system showing an example administration/process control arrangement in accordance with some implementations.

FIG. 3 is a diagram of an example computing device configured for GUI control element processing in accordance with some implementations.

FIGS. 4A and 4B show data source definitions and a corresponding directed acyclic graph (DAG) in accordance with some implementations.

FIGS. 5A and 5B show data source definitions and a corresponding DAG in accordance with some implementations.

FIG. 5C is a table showing the priorities assigned to the nodes of the DAG in FIG. 5B in accordance with some implementations.

FIG. 6 is a flowchart of an example data source refresh process in accordance with some implementations.

FIG. 7 is a flowchart of an example merged join listener process in accordance with some implementations.

FIG. 8 is a flowchart showing example data source refresh processing of the DAG of FIG. 5B using a merged join listener and a priority queue, in accordance with some implementations.

FIG. 9 is a flowchart of an example data source refresh process using an update propagation graph having a merged join listener process in accordance with some implementations.

FIG. 10 is a flowchart of an example data source refresh process using an update propagation graph having a merged join listener process in accordance with some implementations.

FIGS. 11A and 11B show data source definitions and a corresponding DAG in accordance with some implementations.

FIG. 11C shows a DAG corresponding to the data source definitions shown in FIG. 11A in accordance with some implementations.

DETAILED DESCRIPTION

Reference may be made herein to the Java programming language, Java classes, Java bytecode and the Java Virtual Machine (JVM) for purposes of illustrating example implementations. It will be appreciated that implementations can include other programming languages (e.g., groovy, Scala, R, Go, etc.), other programming language structures as an alternative to or in addition to Java classes (e.g., other language classes, objects, data structures, program units, code portions, script portions, etc.), other types of bytecode, object code and/or executable code, and/or other virtual machines or hardware implemented machines configured to execute a data system query.

FIG. 1 is a diagram of an example computer data system and network 100 showing an example data distribution configuration in accordance with some implementations. In particular, the system 100 includes an application host 102, a periodic data import host 104, a query server host 106, a long-term file server 108, and a user data import host 110. While tables are used as an example data object in the description below, it will be appreciated that the data system described herein can also process other data objects such as mathematical objects (e.g., a singular value decomposition of values in a given range of one or more rows and columns of a table), TableMap objects, etc. A TableMap object provides the ability to lookup a Table by some key. This key represents a unique value (or unique tuple of values) from the columns aggregated on in a byExternal( ) statement execution, for example. A TableMap object can be the result of a byExternal( ) statement executed as part of a query. It will also be appreciated that the configurations shown in FIGS. 1 and 2 are for illustration purposes and in a given implementation each data pool (or data store) may be directly attached or may be managed by a file server.

The application host 102 can include one or more application processes 112, one or more log files 114 (e.g., sequential, row-oriented log files), one or more data log tailers 116 and a multicast key-value publisher 118. The periodic data import host 104 can include a local table data server, direct or remote connection to a periodic table data store 122 (e.g., a column-oriented table data store) and a data import server 120. The query server host 106 can include a multicast key-value subscriber 126, a performance table logger 128, local table data store 130 and one or more remote query processors (132, 134) each accessing one or more respective tables (136, 138). The long-term file server 108 can include a long-term data store 140. The user data import host 110 can include a remote user table server 142 and a user table data store 144. Row-oriented log files and column-oriented table data stores are discussed herein for illustration purposes and are not intended to be limiting. It will be appreciated that log files and/or data stores may be configured in other ways. In general, any data stores discussed herein could be configured in a manner suitable for a contemplated implementation.

In operation, the input data application process 112 can be configured to receive input data from a source (e.g., a securities trading data source), apply schema-specified, generated code to format the logged data as it's being prepared for output to the log file 114 and store the received data in the sequential, row-oriented log file 114 via an optional data logging process. In some implementations, the data logging process can include a daemon, or background process task, that is configured to log raw input data received from the application process 112 to the sequential, row-oriented log files on disk and/or a shared memory queue (e.g., for sending data to the multicast publisher 118). Logging raw input data to log files can additionally serve to provide a backup copy of data that can be used in the event that downstream processing of the input data is halted or interrupted or otherwise becomes unreliable.

A data log tailer 116 can be configured to access the sequential, row-oriented log file(s) 114 to retrieve input data logged by the data logging process. In some implementations, the data log tailer 116 can be configured to perform strict byte reading and transmission (e.g., to the data import server 120). The data import server 120 can be configured to store the input data into one or more corresponding data stores such as the periodic table data store 122 in a column-oriented configuration. The periodic table data store 122 can be used to store data that is being received within a time period (e.g., a minute, an hour, a day, etc.) and which may be later processed and stored in a data store of the long-term file server 108. For example, the periodic table data store 122 can include a plurality of data servers configured to store periodic securities trading data according to one or more characteristics of the data (e.g., a data value such as security symbol, the data source such as a given trading exchange, etc.).

The data import server 120 can be configured to receive and store data into the periodic table data store 122 in such a way as to provide a consistent data presentation to other parts of the system. Providing/ensuring consistent data in this context can include, for example, recording logged data to a disk or memory, ensuring rows presented externally are available for consistent reading (e.g., to help ensure that if the system has part of a record, the system has all of the record without any errors), and preserving the order of records from a given data source. If data is presented to clients, such as a remote query processor (132, 134), then the data may be persisted in some fashion (e.g., written to disk).

The local table data server 124 can be configured to retrieve data stored in the periodic table data store 122 and provide the retrieved data to one or more remote query processors (132, 134) via an optional proxy.

The remote user table server (RUTS) 142 can include a centralized consistent data writer, as well as a data server that provides processors with consistent access to the data that it is responsible for managing. For example, users can provide input to the system by writing table data that is then consumed by query processors.

The remote query processors (132, 134) can use data from the data import server 120, local table data server 124 and/or from the long-term file server 108 to perform queries. The remote query processors (132, 134) can also receive data from the multicast key-value subscriber 126, which receives data from the multicast key-value publisher 118 in the application host 102. The performance table logger 128 can log performance information about each remote query processor and its respective queries into a local table data store 130. Further, the remote query processors can also read data from the RUTS, from local table data written by the performance logger, or from user table data read over NFS, for example.

It will be appreciated that the configuration shown in FIG. 1 is a typical example configuration that may be somewhat idealized for illustration purposes. An actual configuration may include one or more of each server and/or host type. The hosts/servers shown in FIG. 1 (e.g., 102-110, 120, 124 and 142) may each be separate or two or more servers may be combined into one or more combined server systems. Data stores can include local/remote, shared/isolated and/or redundant. Any table data may flow through optional proxies indicated by an asterisk on certain connections to the remote query processors. Also, it will be appreciated that the term “periodic” is being used for illustration purposes and can include, but is not limited to, data that has been received within a given time period (e.g., millisecond, second, minute, hour, day, week, month, year, etc.) and which has not yet been stored to a long-term data store (e.g., 140).

FIG. 2 is a diagram of an example computer data system 200 showing an example administration/process control arrangement in accordance with some implementations. The system 200 includes a production client host 202, a controller host 204, a GUI host or workstation 206, and query server hosts 208 and 210. It will be appreciated that there may be one or more of each of 202-210 in a given implementation.

The production client host 202 can include a batch query application 212 (e.g., a query that is executed from a command line interface or the like) and a real time query data consumer process 214 (e.g., an application that connects to and listens to tables created from the execution of a separate query). The batch query application 212 and the real time query data consumer 214 can connect to a remote query dispatcher 222 and one or more remote query processors (224, 226) within the query server host 1 208.

The controller host 204 can include a persistent query controller 216 configured to connect to a remote query dispatcher 232 and one or more remote query processors 228-230. In some implementations, the persistent query controller 216 can serve as the “primary client” for persistent queries and can request remote query processors from dispatchers, and send instructions to start persistent queries. For example, a user can submit a query to 216, and 216 starts and runs the query every day. In another example, a securities trading strategy could be a persistent query. The persistent query controller can start the trading strategy query every morning before the market opened, for instance. It will be appreciated that 216 can work on times other than days. In some implementations, the controller may require its own clients to request that queries be started, stopped, etc. This can be done manually, or by scheduled (e.g., cron) jobs. Some implementations can include “advanced scheduling” (e.g., auto-start/stop/restart, time-based repeat, etc.) within the controller.

The GUI/host workstation can include a user console 218 and a user query application 220. The user console 218 can be configured to connect to the persistent query controller 216. The user query application 220 can be configured to connect to one or more remote query dispatchers (e.g., 232) and one or more remote query processors (228, 230).

FIG. 3 is a diagram of an example computing device 300 in accordance with at least one implementation. The computing device 300 includes one or more processors 302, operating system 304, computer readable medium 306 and network interface 308. The memory 306 can include a data source refresh application 310 and a data section 312 (e.g., for storing DAGs, etc.).

In operation, the processor 302 may execute the application 310 stored in the memory 306. The application 310 can include software instructions that, when executed by the processor, cause the processor to perform operations for data source refreshing in accordance with the present disclosure (e.g., performing one or more of 602-622, 702-718, 802-812, 902-920, and/or 1002-1012 described below).

The application program 310 can operate in conjunction with the data section 312 and the operating system 304.

FIGS. 4A and 4B show data source definitions and a corresponding directed acyclic graph (DAG) in accordance with some implementations. In FIG. 4A, example code defines the data sources as tables (t1-t3). From the code for the data sources, a DAG can be generated as shown by the graph in FIG. 4B. The DAG in FIG. 4B shows dependencies between the nodes, which correspond to table data sources.

Data sources can include market data (e.g., data received via multicast distribution mechanism or through a tailer), system generated data, historical data, user input data from the remote user table server, tables programmatically generated in-memory, or something further upstream in the DAG. In general, anything represented in the data system (e.g., an object, a table) and which can refresh itself/provide data can be a data source. Also, data sources can include non-table data structures which update, for example, mathematical data structures. As shown in FIG. 4A, t3=t1.someOp2( ), where “t1.someOp2( )” represents an operation performed with respect to t1 such as, for example, “t1.svd( )”, where this takes the singular value decomposition of table t1. The SVD would then get updated when t1 changes. The SVD can be represented as a collection of matrices and is an example of a non-tabular result. Similarly, correlation matrices, linear algebra, PDE solvers, a non-matrix, non-tabular data object, etc. can be supported.

In some implementations, code can be converted into the in-memory data structures holding the DAG. For example, the source code of FIG. 4A gets converted into the DAG data structure in memory. The DAG connectivity can change by executing code. For example, assume a set of code CODE1 is executed. CODE1 leads to a DAG1 being created. Data can be processed through DAG1, leading to table updates. Now assume that the user wants to compute a few more tables. The user can run a few more lines of code CODE2, which use variables computed in CODE1. The execution of CODE2 leads to a change in the DAG. As a simple example, assume that the first 3 lines in FIG. 4A are executed. The user could come along later and execute line 4, which would modify the DAG data structure. Also, some implementations can permit other programs to listen to changes from a node representing a data object (e.g., table or non-table object) or an internal node.

In some implementations, when a table changes, an application programming interface (API) can specify rows where add, modify, delete, or reindex (AMDR) changes were made. A reindex is a change in which a row is moved but the value contained in the row is not modified. The API can also provide a mechanism to obtain a value prior to the most recent change. When the DAG is processed during the refresh, the AMD info on “upstream” data objects (e.g., tables, etc.) or nodes is used to compute changes in “downstream” data objects or nodes. In some implementations, the entire DAG can be processed during the refresh cycle.

In general, a DAG can be comprised of a) dynamic nodes (DN); b) static nodes (SN); and c) internal nodes (IN) that can include nodes with DN and/or SN and/or IN as inputs.

DNs are nodes of the graph that can change. For example, DN can be data sources that update as new data comes in. DN could also be timers that trigger an event based on time intervals. In other examples, DN could also be MySQL monitors, specialized filtering criteria (e.g., update a “where” filter only when a certain event happens). Because these nodes are “sources”, they may occur as root nodes in the DAG. At the most fundamental level, DN are root DAG nodes which change (e.g., are “alive”).

SNs are nodes of the DAG that do not change. For example, historical data does not change. IN are interior nodes of the DAG. The state of an IN can be defined by its inputs, which can be DN, SN, and or IN. If all of the IN inputs are “static”, the IN will be static. If one or more of the IN inputs is “dynamic”, the IN will be dynamic. IN can be tables or other data structures. For example, a “listener IN” can permit code to listen to a node of the DAG. A listener node or associated listener monitoring code can place (or “fire”) additional events (or notifications) into a priority queue of a DAG.

In general, a DAG can be composed of static and/or dynamic subgraphs. Update processing occurs on dynamic subgraphs (because static subgraphs are not changing). Only dynamic nodes are in the DataMonitor loop. For Tables, AMDR messages are used for communication within the DAG.

When query code is executed, the DAG is created or modified. As part of this process, the system records the order in which the DAG nodes were constructed in. This “construction ordering” can be used to determine the order that nodes are processed in the DAG.

For example, consider:

a=db.i( . . . ), where a is a dynamic node (or DN)

b=a.where(“A=1”)

c=b.where(“B=2”)

d=c.join(b)

Assume (a) has changes to be processed during a refresh cycle. The order of processing will be (a), (b), (c), and then (d).

When (d) is processed, it will process input changes from both (b) and (c) before creating AMDRs notification messages for (d). This ordering prevents (d) from creating more than one set of AMDRs per input change, and it can help ensure that all AMDRs are consistent with all data being processed for the clock cycle. If this ordering were not in place, it may be possible to get multiple ticks per cycle and some of the data can be inconsistent. Also, the ordering can help ensure that joins produce consistent results.

FIGS. 5A and 5B show data source definitions and a corresponding DAG in accordance with some implementations. In particular, the code of FIG. 5A defines the data sources as tables (t1-t4). From the code for the data sources, a DAG can be generated as shown by the graph in FIG. 5B. The DAG in FIG. 5B shows dependencies between the nodes, which correspond to table data sources. In particular, FIG. 5B shows an example in which a merged join listener MJL is added to the DAG when the join operation (the fourth line of code) in FIG. 5A is executed.

FIG. 5C is a table showing the priorities assigned to the nodes of the DAG in FIG. 5B in accordance with some implementations. In some embodiments, the system records the order in which the DAG nodes were constructed in and uses this “construction ordering” to determine node priorities. In some embodiments, the system can assign the priorities of the nodes after the DAG has been constructed based on the arrangement of nodes within the DAG. The priorities are can be used to determine the order that nodes are processed in the DAG as discussed below (e.g., at 702 in FIG. 7, and at 802-812 in FIG. 8).

It will be appreciated that a join operation can have two or more inputs and one or more outputs, and when generating a DAG a merged join listener node in the DAG can have two or more parent nodes (the two or more inputs to the join operation) and one or more child nodes (the one or more outputs/results of the join operation). For example, in some embodiments, by processing notifications through the DAG based on priorities (e.g., the construction ordering or priority as shown, for example, in FIG. 5C) a merged join listener can process notifications from two or more parent nodes as shown in FIGS. 6 and 7, and discussed below.

FIG. 6 is a flowchart of an example data source refresh process in accordance with some implementations. Processing begins at 602, where a refresh loop for each data source begins. Processing continues to 604.

At 604, the system determines whether the data source has been (or should be) garbage collected. In some implementations, a DAG can use garbage collection to determine when parts of the evolving DAG are no longer needed. This can help ensure referential integrity. To accomplish this, nodes upstream have hard links, while nodes downstream have weak links. As a result of this, children prevent parents from being garbage collected, but parents allow children to be garbage collected. Processing continues to 606.

At 606, a logical clock value (or count) is incremented. In some implementations, the clock has a few uses. One example use is to control how the data is bunched and processed. The logical clock may be used to determine whether data may have changed for producing asynchronous consistent snapshots. Another example is that the logical clock may indicate whether certain data is up-to-date or needs to be recomputed. Processing continues to 608.

At 608, the logical clock state is set to updating. The updating state of the logical clock can be a signal to indicate that an update or refresh cycle of the data sources is in progress. Processing continues to 610.

At 610, a refresh method is called for the data source. Processing continues to 612.

At 612, the system determines whether the priority queue is empty. The priority queue can include a data type similar to a regular queue or stack data structure, where each element has a “priority” associated with it. In a priority queue, an element with high priority is served before an element with low priority. If two elements have the same priority, they are served according to their order in the queue. In some implementations, priority can be based upon DAG position. AMDR messages can include concise summaries of what changed in a table (e.g., data added, modified, deleted, or reordered). The AMDR messages also allow the values at the previous clock cycle to be seen. If the queue is not empty, processing continues to 614. Otherwise, processing continues to 616.

At 614, the next notification from the queue is delivered. Processing continues back to 612.

At 616, the logical clock state is set to idle to indicate the end of the refreshing for this data source. Processing continues to 618.

At 618, terminal notifications are delivered. In some implementations, terminal notifications can include notifications that (1) are processed last and (2) don't have side effects on other nodes and/or data sources. Processing continues to 620.

At 620, the system sleeps. Some implementations can include fixed period clock cycles (e.g., 1 second), but other strategies can work. Some implementations may not sleep, but rather immediately go on and process the next bunch of data. Processing continues to 622.

At 622, the system moves to the next data source and processing continues to 602.

FIG. 7 is a flowchart of an example merged join listener process 700 in accordance with some implementations. Processing begins at 702, where the next notification message in the priority queue to be delivered is determined based on priority. For example, some embodiments use a priority queue such as priority queue 814 shown in FIG. 8 and discussed below. to Processing continues to 704.

At 704, notification A is delivered to destination node. Processing continues to 706.

At 706, it is determined whether the destination node is a merged join listener. If so, processing continues to 710, otherwise processing continues to 708.

At 708, the destination node processes the notification. Optionally, the destination node can add new notifications to the queue. Processing continues to 718.

At 710, the merged join listener determines whether it should update an existing notification for the join result already in the queue or add a new notification to the queue. Processing continues to 712.

At 712, if the merged join listener determines that it should update an existing notification for the join result already in the queue, then processing continues to 714, otherwise processing continues to 716.

At 714, an existing notification in the queue is updated to include the changes indicated by notification A. Processing continues to 718.

At 716, a new notification for the join result is added to the queue based on notification A. Processing continues to 718.

At 718, the system determines whether the priority queue is empty. If it is empty, processing continues to 616, as shown in FIG. 6 and described above. Otherwise, processing continues to 702.

It will be appreciated that process 700 can be repeated in whole or in part to, for example, continue processing updates through the DAG.

FIG. 8 is a flowchart showing example data source refresh processing 800 of the DAG of FIG. 5B using a merged join listener and a priority queue 814, in accordance with some implementations. Processing begins at 802, where source table t1 ticks, inserting notifications 816 and 818 into priority queue 814 for its children (t2 and t3) in the DAG. Processing continues to 804.

At 804, derived table t2 receives notification 816, processes it, and inserts a notification 820 into priority queue 814 for its child (MJL) in the DAG. Processing continues to 806.

At 806, derived table t3 receives notification 818, processes it, and inserts a notification 822 into priority queue 814 for its child (MJL) in the DAG. Processing continues to 808.

At 808, merged join listener MJL receives notification 820, processes it, and inserts notification 824 into queue 814 for its child (t4) in the DAG. Processing continues to 810.

At 810, merged join listener MJL receives notification 822, processes it, and updates existing notification 824 already in queue 814 for its child (t4) in the DAG. Processing continues to 812.

At 812, join result t4 receives merged notification 824 which includes updates for both sides of the join in the same notification and processes the notification to determine a consistent join result. The merged join listener MJL ensures that join result t4 receives notifications from all sides of the join in the same merged notification, thereby allowing join result t4 to safely determine a consistent join result each time it receives a notification.

It will be appreciated that process 800 can be repeated to, for example, process another update through the DAG. It will also be appreciated that in some examples, less than all sides of the join insert a notification into the priority queue and in such examples the merged join listener will merge the notification(s) from those side(s) of the join that generated a notification into one merged notification to be delivered to the join result.

FIG. 9 is a flowchart of an example data source refresh process 900 using an update propagation graph having a merged join listener in accordance with some implementations. Processing begins at 902, where, a request to perform a join operation on two or more objects (e.g., tables) is received. Processing continues to 904.

At 904, an update propagation graph such as, for example, a directed acyclic graph (DAG) is created for propagating updates to a result of the join operation. Processing continues to 906.

At 906, the system determines whether two or more of the tables share a common ancestor. Processing continues to 908.

At 908, if two or more of the tables shared common ancestor processing continues to 910, otherwise processing continues to 916.

At 910, a merged join listener is added to the DAG as a child of the two or more tables. Processing continues to 912.

At 912, a join result table is added to the DAG as a child of the merged join listener. Processing continues to 914.

At 914, updates are propagated through the DAG, including the merged join listener combining all update notifications from the two or more tables for a given update clock cycle into one merged notification and delivering the merged notification to the join result table node of the DAG during that clock cycle to provide a consistent representation of the updates to all sides of the join operation to the join result table node. Processing continues to 920.

At 916, a join result table is added to the DAG as a child of the two or more tables. Processing continues to 918.

At 918, updates are propagated through the DAG.

At 920, a notification is received at the join result table and the changes are applied to update the result of the join operation in a consistent manner without having to re-execute the full join operation. If any two sides of the join are based on a common source, the merge join listener ensures that the join result receives a merged notification that includes all the notifications from all of the sides of the join that have generated a notification. In some embodiments, the notifications can be AMDR messages and the merged notification can include multiple AMDR messages combined into one notification.

It will be appreciated that process 900 can be repeated in whole or in part to, for example, process additional updates through the DAG.

FIG. 10 is a flowchart of an example data source refresh process 1000 using an update propagation graph having a merged join listener in accordance with some implementations. Processing begins at 1002, where a request to perform a join operation on two or more objects (e.g., data sources, tables or non-tabular structures, etc.) is received. Processing continues to 1004.

At 1004, an update propagation graph (UPG) such as, for example, a directed acyclic graph (DAG), is created for propagating updates to a result of the join operation. The join operation can be an operation that operates on two or more inputs and produces one or more outputs. For example, the join operation can be an outer join, an inner join, or a cross join. Processing continues to 1006.

At 1006, a merged join listener is added to the UPG as a child of the two or more objects. Processing continues to 1008.

At 1008, a join result structure is added to the UPG as a child of the merged join listener. Processing continues to 1010. For example, the join result structure can be an object such as a table or a non-tabular structure (e.g., a collection of matrices in the case of an SVD operation, as discussed above).

At 1010, updates are propagated through the UPG, including the merged join listener combining update notifications from the two or more objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result structure's node of the UPG during that clock cycle to provide a consistent representation of the updates to all sides of the join operation to the join result structure node. For example, the merged join listener can combine all update notifications from the two or more objects for a given update clock cycle into one merged notification.

In some embodiments, the combining can include, for each current notification received at the merged join listener during the given update clock cycle, determining whether a priority queue includes an existing notification from the merged notification listener for the given clock cycle; and when it does, updating the existing notification to include an additional notification based on the current notification; and when it doesn't, adding a new notification to the priority queue based on the current notification. Processing continues to 1012.

At 1012, the merged notification is received at the join result table and the changes indicated therein are applied to update the result of the join operation in a consistent manner without having to re-execute the full join operation. In embodiments, when any two sides of the join are based on a common source, the merge join listener ensures that the join result receives one merged notification per logical clock cycle that includes all the notifications from all of the sides of the join that have generated a notification for that logical clock cycle, thereby enabling the system to update the result of the join operation in a consistent manner without having to re-execute the full join operation.

It will be appreciated that process 1000 can be repeated in whole or in part to, for example, process additional updates through the DAG.

It will also be appreciated that, although not shown, process 1000 can include adding a second join result object to the DAG.

In some embodiments, the notifications can be AMDR messages and the merged notification can include multiple AMDR messages combined into one notification.

FIGS. 11A and 11B show data source definitions and a corresponding DAG in accordance with some implementations. As shown in FIG. 11B, join results t4 and t5 share a common merged join listener MJL_1. In operation, the system can, in some embodiments, determine that two join results share the same ancestors and in such cases generate the DAG such that those two join results share a single shared merge join listener. In some embodiments, two merged notification listeners can be used as shown, for example, in FIG. 11C.

FIG. 11C shows a DAG corresponding to the data source definitions shown in FIG. 11A in accordance with some implementations. The DAG in FIG. 11C includes two separate merged join listeners MJL_1 and MJL_2 for join results t4 and t5, respectively, which share the same ancestors.

It will be appreciated that the modules, processes, systems, and sections described above can be implemented in hardware, hardware programmed by software, software instructions stored on a nontransitory computer readable medium or a combination of the above. A system as described above, for example, can include a processor configured to execute a sequence of programmed instructions stored on a nontransitory computer readable medium. For example, the processor can include, but not be limited to, a personal computer or workstation or other such computing system that includes a processor, microprocessor, microcontroller device, or is comprised of control logic including integrated circuits such as, for example, an Application Specific Integrated Circuit (ASIC), a field programmable gate array (FPGA) or the like. The instructions can be compiled from source code instructions provided in accordance with a programming language such as Java, C, C++, C#.net, assembly or the like. The instructions can also comprise code and data objects provided in accordance with, for example, the Visual Basic™ language, a specialized database query language, or another structured, object-oriented or other programming language. The sequence of programmed instructions, or programmable logic device configuration software, and data associated therewith can be stored in a nontransitory computer-readable medium such as a computer memory or storage device which may be any suitable memory apparatus, such as, but not limited to ROM, PROM, EEPROM, RAM, flash memory, disk drive and the like.

Furthermore, the modules, processes systems, and sections can be implemented as a single processor or as a distributed processor. Further, it should be appreciated that the steps mentioned above may be performed on a single or distributed processor (single and/or multi-core, or cloud computing system). Also, the processes, system components, modules, and sub-modules described in the various figures of and for embodiments above may be distributed across multiple computers or systems or may be co-located in a single processor or system. Example structural embodiment alternatives suitable for implementing the modules, sections, systems, means, or processes described herein are provided below.

The modules, processors or systems described above can be implemented as a programmed general purpose computer, an electronic device programmed with microcode, a hard-wired analog logic circuit, software stored on a computer-readable medium or signal, an optical computing device, a networked system of electronic and/or optical devices, a special purpose computing device, an integrated circuit device, a semiconductor chip, and/or a software module or object stored on a computer-readable medium or signal, for example.

Embodiments of the method and system (or their sub-components or modules), may be implemented on a general-purpose computer, a special-purpose computer, a programmed microprocessor or microcontroller and peripheral integrated circuit element, an ASIC or other integrated circuit, a digital signal processor, a hardwired electronic or logic circuit such as a discrete element circuit, a programmed logic circuit such as a PLD, PLA, FPGA, PAL, or the like. In general, any processor capable of implementing the functions or steps described herein can be used to implement embodiments of the method, system, or a computer program product (software program stored on a nontransitory computer readable medium).

Furthermore, embodiments of the disclosed method, system, and computer program product (or software instructions stored on a nontransitory computer readable medium) may be readily implemented, fully or partially, in software using, for example, object or object-oriented software development environments that provide portable source code that can be used on a variety of computer platforms. Alternatively, embodiments of the disclosed method, system, and computer program product can be implemented partially or fully in hardware using, for example, standard logic circuits or a VLSI design. Other hardware or software can be used to implement embodiments depending on the speed and/or efficiency requirements of the systems, the particular function, and/or particular software or hardware system, microprocessor, or microcomputer being utilized. Embodiments of the method, system, and computer program product can be implemented in hardware and/or software using any known or later developed systems or structures, devices and/or software by those of ordinary skill in the applicable art from the function description provided herein and with a general basic knowledge of the software engineering and computer networking arts.

Moreover, embodiments of the disclosed method, system, and computer readable media (or computer program product) can be implemented in software executed on a programmed general purpose computer, a special purpose computer, a microprocessor, or the like.

It is, therefore, apparent that there is provided, in accordance with the various embodiments disclosed herein, methods, systems and computer readable media for data source refreshing.

Application Ser. No. 15/154,974, entitled “DATA PARTITIONING AND ORDERING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,975, entitled “COMPUTER DATA SYSTEM DATA SOURCE REFRESHING USING AN UPDATE PROPAGATION GRAPH” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,979, entitled “COMPUTER DATA SYSTEM POSITION-INDEX MAPPING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,980, entitled “SYSTEM PERFORMANCE LOGGING OF COMPLEX REMOTE QUERY PROCESSOR QUERY OPERATIONS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,983, entitled “DISTRIBUTED AND OPTIMIZED GARBAGE COLLECTION OF REMOTE AND EXPORTED TABLE HANDLE LINKS TO UPDATE PROPAGATION GRAPH NODES” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,984, entitled “COMPUTER DATA SYSTEM CURRENT ROW POSITION QUERY LANGUAGE CONSTRUCT AND ARRAY PROCESSING QUERY LANGUAGE CONSTRUCTS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,985, entitled “PARSING AND COMPILING DATA SYSTEM QUERIES” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,987, entitled “DYNAMIC FILTER PROCESSING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,988, entitled “DYNAMIC JOIN PROCESSING USING REAL-TIME MERGED NOTIFICATION LISTENER” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,990, entitled “DYNAMIC TABLE INDEX MAPPING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,991, entitled “QUERY TASK PROCESSING BASED ON MEMORY ALLOCATION AND PERFORMANCE CRITERIA” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,993, entitled “A MEMORY-EFFICIENT COMPUTER SYSTEM FOR DYNAMIC UPDATING OF JOIN PROCESSING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,995, entitled “QUERY DISPATCH AND EXECUTION ARCHITECTURE” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,996, entitled “COMPUTER DATA DISTRIBUTION ARCHITECTURE” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,997, entitled “DYNAMIC UPDATING OF QUERY RESULT DISPLAYS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,998, entitled “DYNAMIC CODE LOADING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,999, entitled “IMPORTATION, PRESENTATION, AND PERSISTENT STORAGE OF DATA” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,001, entitled “COMPUTER DATA DISTRIBUTION ARCHITECTURE” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,005, entitled “PERSISTENT QUERY DISPATCH AND EXECUTION ARCHITECTURE” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,006, entitled “SINGLE INPUT GRAPHICAL USER INTERFACE CONTROL ELEMENT AND METHOD” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,007, entitled “GRAPHICAL USER INTERFACE DISPLAY EFFECTS FOR A COMPUTER DISPLAY SCREEN” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,009, entitled “COMPUTER ASSISTED COMPLETION OF HYPERLINK COMMAND SEGMENTS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,010, entitled “HISTORICAL DATA REPLAY UTILIZING A COMPUTER SYSTEM” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,011, entitled “DATA STORE ACCESS PERMISSION SYSTEM WITH INTERLEAVED APPLICATION OF DEFERRED ACCESS CONTROL FILTERS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,012, entitled “REMOTE DATA OBJECT PUBLISHING/SUBSCRIBING SYSTEM HAVING A MULTICAST KEY-VALUE PROTOCOL” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/351,429, entitled “QUERY TASK PROCESSING BASED ON MEMORY ALLOCATION AND PERFORMANCE CRITERIA” and filed in the United States Patent and Trademark Office on Nov. 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/813,142, entitled “COMPUTER DATA SYSTEM DATA SOURCE HAVING AN UPDATE PROPAGATION GRAPH WITH FEEDBACK CYCLICALITY” and filed in the United States Patent and Trademark Office on Nov. 14, 2017, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/813,127 entitled “COMPUTER DATA DISTRIBUTION ARCHITECTURE CONNECTING AN UPDATE PROPAGATION GRAPH THROUGH MULTIPLE REMOTE QUERY PROCESSORS” and filed in the United States Patent and Trademark Office on Nov. 14, 2017, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/813,119, entitled “KEYED ROW SELECTION” and filed in the United States Patent and Trademark Office on Nov. 14, 2017, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

While the disclosed subject matter has been described in conjunction with a number of embodiments, it is evident that many alternatives, modifications and variations would be, or are, apparent to those of ordinary skill in the applicable arts. Accordingly, Applicants intend to embrace all such alternatives, modifications, equivalents and variations that are within the spirit and scope of the disclosed subject matter. 

What is claimed is:
 1. A system for updating a data object using update propagation graphs and merged join listeners to determine consistent join update processing, the system comprising: one or more hardware processors coupled to a nontransitory computer readable medium having stored thereon software instructions that, when executed by the one or more processors, cause the one or more processors to perform operations including: receiving a request to perform a join operation on a plurality of objects; creating an update propagation graph (UPG) for propagating updates to a result of the join operation; adding a merged join listener to the UPG as a child of the plurality of objects; adding a join result structure to the UPG as a child of the merged join listener; propagating updates through the UPG, including the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result structure's node of the UPG during that clock cycle to provide a consistent representation of updates to all sides of the join operation to the join result structure node, the merged join listener combining comprising: for each current notification received at the merged join listener during the given update clock cycle: determining whether a priority queue includes an existing notification from the merged notification listener for the given clock cycle, the priority queue comprising a plurality of elements, each of the elements having an associated node of the UPG and an associated priority such that a first element with a first priority is served before a second element with a second priority lower than the first priority, the priority of each of the elements of the priority queue being determined based on an arrangement of nodes within the UPG, when the priority queue includes an existing notification from the merged notification listener for the given clock cycle, updating, responsive to the determining, the existing notification to include an additional notification based on the current notification, and when the priority queue does not include an existing notification from the merged notification listener for the given clock cycle, adding a new notification to the priority queue based on the current notification; and receiving the merged notification at the join result structure node and applying changes to update the result of the join operation in a consistent manner without having to re-execute the full join operation.
 2. The system of claim 1, wherein the plurality of objects are tables.
 3. The system of claim 2, wherein the join result structure is a table.
 4. The system of claim 1, wherein the UPG is a directed acyclic graph (DAG).
 5. The system of claim 1, wherein the plurality of objects consists of three or more objects.
 6. The system of claim 1, wherein the operations further include adding a second join result structure to the UPG as a second child of the merged join listener.
 7. The system of claim 6, wherein the operations further include receiving a second merged notification at the second join result structure's node of the UPG.
 8. The system of claim 1, wherein the join result structure comprises a plurality of matrices.
 9. The system of claim 1, wherein the join operation is an operation selected from a group consisting of an outer join, an inner join, and a cross join.
 10. A method comprising: receiving, at a query processor, a request to perform a join operation on a plurality of objects; creating, at the query processor, an update propagation graph (UPG) for propagating updates to a result of the join operation; adding a merged join listener to the UPG as a child of the plurality of objects; adding a join result to the UPG as a child of the merged join listener; propagating updates through the UPG, including the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result's node of the UPG during that clock cycle to provide a consistent representation of updates to all sides of the join operation to the join result node, the propagating comprising processing notifications using a priority queue, the priority queue comprising a plurality of elements, each of the elements having an associated node of the UPG and an associated priority such that a first element with a first priority is served before a second element with a second priority lower than the first priority, the priority of each of the elements of the priority queue being determined based on an arrangement of nodes within the UPG; and receiving the merged notification at the join result node and applying changes to update the result of the join operation in a consistent manner without having to re-execute the full join operation.
 11. The method of claim 10, wherein the plurality of objects consists of a plurality of tables.
 12. The method of claim 10, wherein the UPG is a directed acyclic graph (DAG).
 13. The method of claim 10, wherein the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification comprises: receiving a notification during the given update clock cycle; determining that the priority queue includes an existing notification from the merged notification listener for the given clock cycle; updating, responsive to the determining, the existing notification to include an additional notification based on the received notification.
 14. The method of claim 10, wherein the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification comprises: receiving a notification during the given update clock cycle; determining that the priority queue does not include an existing notification from the merged notification listener for the given clock cycle; and adding, responsive to the determining, a new notification from the merged notification listener to the priority queue based on the received notification.
 15. The method of claim 14, wherein the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification further comprises: receiving a second notification during the given update clock cycle; determining that the priority queue includes an existing notification from the merged notification listener for the given clock cycle, the existing notification being the new notification; updating, responsive to the determining, the existing notification to include an additional notification based on the second notification.
 16. The method of claim 10, further comprising adding a second join result structure to the UPG as a second child of the merged join listener.
 17. The method of claim 16, further comprising receiving a second merged notification at the second join result structure's node of the UPG.
 18. The method of claim 17 wherein the second merged notification is a copy of the merged notification.
 19. The method of claim 10, wherein the join result structure comprises a plurality of matrices.
 20. The method of claim 10, wherein the join operation is an operation selected from a group consisting of an outer join, an inner join, and a cross join.
 21. A nontransitory computer readable medium having stored thereon software instructions that, when executed by one or more processors, cause the one or more processors to perform operations including: receiving a request to perform a join operation on a plurality of objects; creating an update propagation graph (UPG) for propagating updates to a result of the join operation; adding a merged join listener to the UPG as a child of the plurality of objects; adding a join result structure to the UPG as a child of the merged join listener; propagating updates through the UPG, including the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result structure's node of the UPG during that clock cycle to provide a consistent representation of updates to all sides of the join operation to the join result structure node, the propagating comprising processing notifications using a priority queue, the priority queue comprising a plurality of elements, each of the elements having an associated node of the UPG and an associated priority such that a first element with a first priority is served before a second element with a second priority lower than the first priority, the priority of each of the elements of the priority queue being determined based on an arrangement of nodes within the UPG; and receiving the merged notification at the join result structure node and applying changes to update the result of the join operation in a consistent manner without having to re-execute the full join operation.
 22. The nontransitory computer readable medium of claim 21, wherein the plurality of objects are tables and the join result structure is a table.
 23. The nontransitory computer readable medium of claim 21, wherein the UPG is a directed acyclic graph (DAG).
 24. The nontransitory computer readable medium of claim 21, wherein the plurality of objects consists of three or more objects.
 25. The nontransitory computer readable medium of claim 21, wherein the operations further include adding a second join result structure to the UPG as a second child of the merged join listener.
 26. The nontransitory computer readable medium of claim 21, wherein the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification comprising: for each current notification received at the merged join listener during the given update clock cycle: determining whether the priority queue includes an existing notification from the merged notification listener for the given clock cycle, when the priority queue includes an existing notification from the merged notification listener for the given clock cycle, updating, responsive to the determining, the existing notification to include an additional notification based on the current notification, and when the priority queue does not include an existing notification from the merged notification listener for the given clock cycle, adding a new notification to the priority queue based on the current notification.
 27. The nontransitory computer readable medium of claim 21, wherein the join result structure comprises a plurality of matrices.
 28. The nontransitory computer readable medium of claim 21, wherein the join operation is an operation selected from a group consisting of an outer join, an inner join, and a cross join.
 29. A system comprising: one or more hardware processors coupled to a nontransitory computer readable medium having stored thereon software instructions that, when executed by the one or more processors, cause the one or more processors to perform operations including: receiving a request to perform a join operation on a plurality of data sources; creating an update propagation graph (UPG) for propagating updates to a result of the join operation; adding a merged join listener to the UPG as a child of the plurality of objects; adding a join result structure to the UPG as a child of the merged join listener; propagating updates through the UPG, including the merged join listener combining update notifications from the plurality of objects for a given update clock cycle into a merged notification and delivering the merged notification to the join result structure's node of the UPG during that clock cycle to provide a consistent representation of updates to all sides of the join operation to the join result structure node, the propagating comprising processing notifications using a priority queue, the priority queue comprising a plurality of elements, each of the elements having an associated node of the UPG and an associated priority such that a first element with a first priority is served before a second element with a second priority lower than the first priority, the priority of each of the elements of the priority queue being determined based on an arrangement of nodes within the UPG; and receiving the merged notification at the join result structure node and applying changes to update the result of the join operation in a consistent manner without having to re-execute the full join operation.
 30. The system of claim 29, wherein the join operation is an operation selected from a group consisting of an outer join, an inner join, and a cross join. 